Minig Top-K High Utility Itemsets - Report

نویسندگان

  • Daniel Yu
  • Cheng Wei Wu
چکیده

Utility mining, which refers to the discovery of itemsets with utilities higher than a user-specified minimum utility threshold, is an important task and has a wide range of applications, especially in e-commerce. But setting an appropriate minimum utility threshold is a difficult problem. If the minimum threshold is set to low, too many high utility itemsets will be generated and it takes a long time to compute, while setting the minimum threshold too high would result in too few results. Setting appropriate minimum utility threshold by trial and error is not very efficient. We want to discuss in this report how this can be done better.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for High Average-utility Itemset Mining

High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...

متن کامل

Mining top-k high utility patterns over data streams

Online high utility itemset mining over data streams has been studied recently. However, the existing methods are not designed for producing topk patterns. Since there could be a large number of high utility patterns, finding only top-k patterns is more attractive than producing all the patterns whose utility is above a threshold. A challenge with finding top-k high utility itemsets over data s...

متن کامل

High Utility Itemsets Mining – A Brief Explanation with a Proposal

High utility itemsets mining is relevant for business vendors. So that they can give more offers to high utility itemsets. To understand the above sentence we need to know what is high utility itemsets. High utility itemsets are those ones that yield high profit when sold together or alone that meets a user-specified minimum utility threshold from a transactional database. This high utility ite...

متن کامل

An Algorithm of Top-k High Utility Itemsets Mining over Data Stream

Existing top-k high utility itemset (HUI) mining algorithms generate candidate itemsets in the mining process; their time & space performance might be severely affected when the dataset is large or contains many long transactions; and when applied to data streams, the performance of corresponding mining algorithm is especially crucial. To address this issue, propose a sliding window based top-k...

متن کامل

Mining Long High Utility Itemsets in Transaction Databases

Although support has been used as a fundamental measure to determine the statistical importance of an itemset, it can’t express other richer information such as quantity sold, unit profit, or other numerical attributes. To overcome the shortcoming, utility is used to measure the semantic importance and several algorithms for utility mining have been proposed. However, existing algorithms for ut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015